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Abstract — Phase-change memory (PCM) is a promising non- 
volatile solid-state memory technology. A PCM cell stores data 
by using its amorphous and crystalline states. The cell changes 
between these two states using high temperature. However, since 
the cells are sensitive to high temperature, it is important, when 
programming cells, to balance the heat both in time and space. 

In this paper, we study the time-space constraint for PCM, 
which was originally proposed by Jiang et al. A code is called an 
(a, j3,p)-constrained code if for any a consecutive rewrites and 
for any segment of /3 contiguous cells, the total rewrite cost of 
the ft cells over those a rewrites is at most p. Here, the cells 
are binary and the rewrite cost is defined to be the Hamming 
distance between the current and next memory states. First, we 
show a general upper bound on the achievable rate of these codes 
which extends the results of Jiang et al. Then, we generalize their 
construction for (a ^ l,/3 = l,p = 1) -constrained codes and 
show another construction for (a = l,/3 > l,p > 1) -constrained 
codes. Finally, we show that these two constructions can be used 
to construct codes for all values of a, j3, and p. 



I. Introduction 

Phase-change memory (PCM) devices are a promising technol- 
ogy for non-volatile memories. Like a flash memory, a PCM 
consists of cells that can be in distinct physical states. In the 
simplest case, the PCM cell has two possible states, an amor- 
phous state and a crystalline state. Multiple-bit per cell PCMs 
can be implemented by using partially crystalline states J5). 

While in a flash memory one can decrease a cell level only 
by erasing the entire block of about 10 6 cells that contains it, 
in a PCM one can independently decrease an individual cell 
level - but only to level zero. This operation is called a RESET 
operation. A SET operation can then be used to change the 
cell state to any valid level. Therefore, in order to decrease a 
cell level from one non-zero value to a smaller non-zero value, 
one needs to first RESET the cell to level zero, and then SET 
it to the new desired level J5J. Thus, as with flash memory 
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programming, there is a significant asymmetry between the 
two operations of increasing and decreasing a cell level. 

As in a flash memory, a PCM cell has a limited lifetime; 
the cells can tolerate only about 10 7 — 10 8 RESET opera- 
tions before beginning to degrade fl3l . Therefore, it is still 
important when programming cells to minimize the number 
of RESET operations. Furthermore, a RESET operation can 
negatively affect the performance of a PCM in other ways. 
One of them is due to the phenomenon of thermal crosstalk. 
When a cell is RESET, the levels of its adjacent cells may in- 
advertently be increased due to heat diffusion associated with 
the operation ]5J, EH . Another problem, called thermal ac- 
cumulation, arises when a small area is subjected to a large 
number of program operations over a short period of time 151 , 
l2ll . The resulting accumulation of heat can significantly limit 
the minimum write latency of a PCM, since the programming 
accuracy is sensitive to temperature. It is therefore desirable 
to balance the thermal accumulation over a local area of PCM 
cells in a fixed period of time. Coding schemes can help over- 
come the performance degradation resulting from these phys- 
ical phenomena. Lastras et al. Ifl8l studied the capacity of a 
Write-Efficient Memory (WEM) |2j for a cost function that 
is associated with the write model of phase-change memories 
described above. 

Jiang et al. Ifl6l have proposed codes to mitigate thermal 
cross-talk and heat accumulation effects in PCM. Under their 
thermal cross-talk model, when a cell is RESET, the levels of 
its immediately adjacent cells may also be increased. Hence, 
if these neighboring cells exceed their target level, they also 
will have to be RESET, and this effect can then propagate to 
many more cells. In lfl6l . they considered a special case of 
this and proposed the use of constrained codes to limit the 
propagation effect. Capacity calculations for these codes were 
also presented. 

The other problem addressed in |fl6l is that of heat accu- 
mulation. In this model, the rewrite cost is defined to be the 
number of programmed cells, i.e., the Hamming distance be- 
tween the current and next cell-state vectors. A code is said to 
be (a, f3,p)-constrained if for any a consecutive rewrites and 
for any segment of /3 contiguous cells, the total rewrite cost 
of the (3 cells over those a rewrites is at most p. A specific 
code construction was given for the (a ^ = l,p = 1)- 
constraint as well as an upper bound on the achievable rate 
of codes for this constraint. An upper bound on the achiev- 
able rate was also given for (a = l,/3 ^ l,p = 1) -constrained 
codes. 

The work in lfl6l dealt with only a few instances of the 
parameters a, f3 and p. In this paper, we extend the code 
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constructions and achievable-rate bounds to a larger portion 
of the parameter space. In Section [TTJ we formally define 
the constrained-coding problem for PCM. In Section [HI] us- 
ing connections to two-dimensional constrained coding, we 
present a scheme to calculate an upper bound on the achiev- 
able rate for all values of a, (3 and p. If the value of a 
or j3 is 1 then the two-dimensional constraint becomes a 
one-dimensional constraint and we calculate the upper bound 
on the achievable rate for all values of p. This result coin- 
cides with the result in lfl6l for (a l,/3 = l,p = 1) and 
(q = 1,(3 ^ l,p = 1). We also derive upper bounds for some 
cases with parameters satisfying (a > 1, (3 > l,p = 1) using 
known results on the upper bound of two-dimensional con- 
strained codes. In Section |IV] code constructions are given. 
First, a trivial construction is given and we show an improve- 
ment for (a — 1, (3 ^ l,p ^ 1) -constrained codes and extend 
the construction in |[T6l of (a ^ l,/3 = l,p = 1) -constrained 
codes to arbitrary p. Finally, we show how to extend the 
constructions for all values of a, (3 and p. 

II. Preliminaries 

In this section, we give a formal definition of the constrained- 
coding problem. The number of cells is denoted by n 
and the memory cells are binary. The cell-state vectors 
are the binary vectors from {0,1}". If a cell-state vector 
u = (ui, . . . ,u n ) £ {0, 1}" is rewritten to another cell-state 
vector v = (vi, . . . ,v n ) £ {0,1}", then the rewrite cost is 
defined to be the Hamming distance between u and v, that is 

d H (u,v) = \{i : Ui ^ Vi, 1 < i < n}\. 

The Hamming weight of a vector u is wt(u) = dn(u,0). 
The complement of a vector u is u = (ui, ...,I n ), For a 
vector x = (xi, ■ ■ ■ , x n ), we define for all 1 ^ p ^ q ^ n, 
x p = ( x p> x p+i> ■ ■ ■ i x q)- The set {i, i + 1, . . . , j} is denoted 
by [i : j] for i ^ j, and in particular, {1,2,..., [2" fl J} is 
denoted by [1 : 2 nR ] for an integer n and real R. 

Definition 1. Let a, (3,p be positive integers. A code C satis- 
fies the (a, (3, p) -constraint if for any a consecutive rewrites 
and for any segment of f3 contiguous cells, the total rewrite 
cost of those f3 cells over those a rewrites is at most p, 
and C is called an (a, (3, p) -constrained code. That is, if 
Vi = (vi.i, . . . , Vi^ n ), for i ^ 1, is the cell-state vector on the 
i-th write, then, for all i ^ 1 and 1 ^ j ^ n — j3 + 1, 



{(k, £) : v i+ k,j+i + v i+k+ x,j + i, ^k<a,0^£<l3} 



We will specify (a, (3,p) -constrained codes by an explicit con- 
struction of their encoding and decoding maps. On the z-th 
write, for i ^ 1, the encoder 

Si : [1 : 2 nRi ] x {0, 1}" h-> {0, 1}" 

maps the new information symbol and the current cell-state 
vector to the next cell-state vector. The decoder 



maps the cell-state vector to the represented information sym- 
bol. We denote the individual rate on the i-th write of the 
(a, /3,p)-constrained code by Note that the alphabet size 
of the messages on each write does not have to be the same. 
The rate R of the (a, -constrained code is defined as 



R 



lim 

m— >oo 



E™ i Ri 



(i) 



Vi : {0, 1}" ^[1:2 



nRi 



We assume that the number of writes is large and in 
the constructions we present there will be a periodic se- 
quence of writes. Thus, it will be possible to change any 
(a, /3,p)-constrained code C with varying individual rates to 
an (a, /3,p) -constrained code C with fixed individual rates 
such that the rates of the two constrained codes are the same. 
This can be achieved by using multiple copies of the code C 
and in each copy of C to start writing from a different write 
within the period of writes. Therefore, we assume that there 
is no distinction between the two cases and the rate is as de- 
fined in Equation which is the number of bits written per 
cell per write. 

The encoding and decoding maps can be either the same 
on all writes or can vary among the writes. In the latter case, 
we will need more cells in order to index the write number. 
However, arguing as in 1261 . it is possible to show that these 
extra cells do not reduce asymptotically the rate and therefore 
we assume here that the encoder and decoder know the write 
number. 

A rate R is called an (a, f3, p)-achievable rate if there ex- 
ists an (a, (3,p) -constrained code C such that the rate of C is 
R. We denote by C n (a, /3,p) the supremum of all (a,(3,p)- 
achievable rates while fixing the number of cells to be n. The 
(a, (3,p)-capacity of the (a, -constraint is denoted by 
C(a, (3,p) and is defined to be 

C(a,(3,p) = lim C„(a,/3,p). 

n— >oo 

Our goal in this paper is to give lower and upper bounds on 
the (a, /3,p) -capacity, C(a,(3,p), for all values of a, (3, and 
p. Clearly, if p ^ a/3 then C(a,f3,p) = 1. So we assume 
throughout the paper that p < a(3. Lower bounds will be 
given by specific constrained code constructions while the up- 
per bounds will be derived analytically using tools drawn from 
the theory of one- and two-dimensional constrained codes. 

III. Upper Bound on the Capacity 

In this section, we will present upper bounds on the (a, (3,p)- 
capacity obtained using techniques from the analysis of 
two-dimensional constrained codes. There are a number of 
two-dimensional constraints that have been extensively stud- 
ied, e.g., 2-dimensional (d, fc)-runlength-limited (RLL) ifTTl . 
l23l . no isolated bits fl2l . fl4l . and the checkerboard con- 
straint l20l . l25l . Given a two-dimensional constraint S, its 
capacity is defined to be 

C 2D{S) = lim l0g2Cs(w ' n) , 

ra.n— s-oo rati 

where €5(771, n) is the number of m x n arrays that satisfy the 
constraint S. The constraint of interest for us in this work is 
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the one where in each rectangle of size a x b, the number of 
ones is at most p. 

Definition 2. Let a,b,p be positive integers. An (m x n) -array 

A = ( ffl i,j)i<i<m,i<j<n € {0, l} mxn is caJJed an (a, b, p)- 
array if in each sub-array of A of size a x b, the number of 1 's 
is atmostp. That is, for all 1 ^ i ^ m— a+1, 1 ^ j ^ n— 6+1, 

|{(M) : 0<A<a-l,0<^<6-l,Oi+ w+ « = 1}| <p. 
The capacity of the constraint is denoted by C 2 d (a, b, p). 

Note that when p = 1, the (a, a, 1) constraint coincides 
with the square checkerboard constraint of order a — 1 ll25ll . 

The connection between the capacity of the two- 
dimensional constraint C 2 D(a,b,p) and the (a, (3, p) -capacity 
is the following. 

Theorem 1. For alia, /3,p, C(a, /3,p) < G 2 n{a.,fi,p). 

Proof: Let C be an (a, /3, p) -constrained code of length n. 
For any sequence of m writes, let us denote by Vi, for i ^ 0, 
the cell-state vector on the z'-th write, where Vq is the all-zero 
vector. The to x 71-array A = (dij) is defined to be 

Oi,j = v i,j + v i-l,j> 

where the addition is a modulo 2 sum. That is, ajj = 1 if 
and only if the j-th cell is changed on the z-th write. Since C 
is an (a, /3,p) -constrained code, for all 1 ^ i ^ to — a and 

l<j<n-/3 + l, 

|{(M) : «i+fc,j+^ 7^ < fc < a,0 ^ £ < /3}| < p, 

and therefore 

\{(k,£) : 0<A<a-l,0<^<j8-l > a i+fclj+ / = l}| <p. 

Thus, A is an (a, /3,p)-array of size m x n. 

Every write sequence of the code C corresponds to an 
(a, /3,p)-array and thus the number of write sequences of 
length m is at most the number of (a, /3,p)-arrays, which 
is upper bounded by 2 mnC2Dl - a ^' p \ for m,n large enough. 
Hence, the number of distinct write sequences is at most 
2mnC 2D {a,0, P ) _ However, if the individual rate on the z'-th 
write is Rj, then the total number of distinct write sequences 



^ m 



We conclude that 



and, therefore, 



i=l 



rn 



^C 2D {a,f3,p). 



If to goes to infinity, the rate of any (a, /3,p)-constrained code 
R satisfies 

R^C 2D {a,/3,p), 

i.e., C(a,0,p) < C 2D {a,f3,p). ■ 
Theorem Q] provides a scheme to calculate an upper bound 
on the (a, (3, p) -capacity from an upper bound on the capac- 
ity of a two-dimensional rectangular checkerboard constraint. 



Unfortunately, good upper bounds are known only for some 
special cases of the values of ot,(3,p, and in particular, when 
p = 1. The checkerboard constraint has attracted considerable 
attention over the past 20 years and some lower and upper 
bounds on the capacity were given in l20ll . l24l . 11251 . For in- 
stance, some upper bounds for the square checkerboard con- 
straint are shown in fl25l , from which we can conclude that 
C(2, 2, 1) < 0.43431 and C(3, 3, 1) < 0.25681. 

In the rest of this section we discuss the cases where a = 1 
or /3 = 1 corresponding to one-dimensional constraints. 
First, let us consider the upper bound on C(a = l,(3,p). 
We use the one-dimensional (d, fc)-runlength-limited (RLL) 
constrained codes 1271 . where the number of 0's between ad- 
jacent l's is at least d and at most k. In fact, Jiang et al. lfl6l 
showed that the capacity of the (/3 — l,oo)-RLL constraint 
is an upper bound on C(l,/3,1), which is a special case of 
Theorem Q] The lowest curve in Fig. Q] shows the capacity of 
the (/3 — 1, 00) -RLL constraint. We extend the upper bounds 
to arbitrary p ^ 1. First, let us generalize the definition of 
RLL-constrained codes. 

Definition 3. Let /3,p be two positive integers. A binary vector 
u satisfies the (/3 , p) -window-weight-limited (WWL) con- 
straint if for any j3 consecutive cells there are at most p l's 
and u is called a (/3,p)-WWL vector. We denote the capacity 
of the constraint by Cwwl{(3,p)- 

Note that for p = 1, the (f3, 1)-WWL constraint is the (f3 - 
l,oo)-RLL constraint. According to TheoremQ] C(l,/3,p) is 
upper bounded by the capacity of the (/3,p)-WWL constraint, 
Cwwl(I3,p). Thus, we are interested in finding the capac- 
ity of this constraint. The approach is similar to the one used 
in 11251 in order to find an upper bound on the capacity of the 
checkerboard constraint. 

Definition 4. A merge of two vectors u and v of the same 
length n is a function: 

/„:{0,l}"x{0,ir^{0,l}" +1 U{F}. 

If the last n — l bits ofu are the same as the first n—1 bits of 
v, the vector f n (u, v) is the vector it concatenated with the last 
bit ofv, otherwise f n (u, v) = F. 

Definition 5. Let f3, p be two positive integers. Let Sp lP denote 
the set of all vectors of length /3 — 1 having at mostp 1 's. That 
is, Sp.p = {s £ {0,1] 



1 1 : wt(s) ^ p). The size of the set 
Sp, p isM = Yn=o i^ 1 )- Let si, s 2 , .. .,s M bean ordering of 
the vectors in Sp }P . The transition matrix for the (j3,p)-WWL 
constraint, Ap, v = (a^j) £ {0, l} MxM is defined as follows: 

_ / 1 if fp-i(si, sj) ^Fandvjt(fp-i(8i,8j)) < p, 
otherwise. 

Example 1. The following illustrates the construction of the 
^4/3=3, p =2 transition matrix. Note that 

S 3 ,2 = {ax, s 2 ,s 3 , Si } = {(0, 0), (0, 1), (1, 0), (1, 1)}, 
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The merge of Sj and Sj for i,j — 1,2,3,4 determines the 
matrix A^^- For example, ^(^l, Si) = (0,0,0), a±,i = 1; 
f 2 (s 2 , Sl )'=F, 02,i = 0; / a («i,s a ) = (0,0,1), ai, a '= 1 ^ 
02,i. This shows that the matrix is not necessarily symmetric. 
Finally, f 2 {s 37 s 3 ) = (1,1,1), and a 3 , 3 = since (1,1,1) 
does not satisfy the (3,2)-WWL constraint. 



A 



3,2 



^ 1 1 ^ 

11 
110 
\ 1 / 



Definition 6. A matrix A e {0, l} AIxAI is irreducible if for all 
1 ^ i,j ^ M there exists some n ^ such that (A")ij > 0. 
Note that n can be a function ofi and j. 

Lemma 1. For positive integers j3,p, the transition matrix A Pt p 
is irreducible. 

Proof of Lemma^ In our construction of Ap tP , it is pos- 
sible to show that (Ag p )ij is the number of vectors of length 
n + (3 — 1 starting in Si, ending in Sj and satisfying (j3,p)- 
WWL constraint, where Sj and Sj are defined in Definition [5] 
Therefore, A$, p is irreducible if such a vector of length n 1 
exists such that it starts with Si and ends in Sj, for every pair 
of Clearly it exists since adding zeros between s, and 

Sj gives such a valid vector. This proves the irreducibility of 

Ap tP . m 

The next theorem is a special case of Theorem 3.9 in lfl9l . 

Theorem2. The capacity of the ((3 , p)-WWL constraint is 

Cwwl{/3,p) = log 2 (A mo x), 
where X max is the largest real eigenvalue of Ap yP . 

Proof: See Theorem 3.9 in ED. ■ 
Fig. Q] shows Cwwl{/3,p), which is the upper bound of 
C(l, f3,p), for p — 1, 2, 3, 4 respectively. 
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Fig. 1. Upper bound on C(l,/3,p) 



Remark 1. There is a way of presenting the (/3,p)-WWL con- 
straint using labeled graphs (state transition diagrams). We 



present an example of the labeled graph (transition diagram) 
for the (f} = 7,p= 2)-WWL constraint in Fig. |2] An (j3,p)- 
WWL vector can be generated by reading off the labels along 
paths in the graph and the sequences in the ellipses indicate 
the six most recent digits generated. 




Fig. 2. Labeled graphs presenting the (7,2)-WWL constraint 



Remark 2. According to Theorem Q] the capacity C(a, l,p) 
is also upper bounded by the capacity of the (a,p)-WWL 
constraint, Cwwl(&,p)- Jiang et al. [16] proposed an upper 
bound on the rate of an (a, 1, 1) -constrained code with fixed 
block length n and multiple cell levels. By numerical experi- 
ments, we find that their upper bound appears to converge to 
our upper bound for binary cells when n — > oo. 

IV. Lower Bound on the Capacity 

In this section, we give lower bounds on the capacity of the 
(a, /3,p)-constraint based upon specific code constructions. 
The first construction we give is a trivial one which achieves 
rate Then, we will show how to improve it for the cases 
(l,/3,p) and (a, l,p). In this section we assume that for all 
positive integers x,y the value of a;(mody) belongs to the 
group {1, . . . , y] via the correspondence — > y. 

The idea of Construction [T] is to partition the set of n cells 
into subblocks of size j3. Suppose p = (3(q — 1) + r, where 1 ^ 
q ^ a and 1 ^ r ^ /3. The encoding process has a period of a 
writes. On the first q — 1 writes, all cells in each subblock are 
programmed with no constraint. On the q-th write, the first r 
cells in each subblock are programmed with no constraint and 
the rest of the cells are not programmed (staying at level 0). 
On the (q + l)-st to the a-th write, no cells are programmed. 
The details of the construction are as follows. 

Construction 1 Let a, /3,p be positive integers. We construct 
an (a, (3, p) -constrained code C of length n as follows. To sim- 
plify the construction, we assume that /3\n. Let q = § , r = 
p(mod(3). For all i 1, on the i-th write, the encoder uses the 
following rules: 

• If 1 < i(moda) < q, n bits are written to the n cells. 
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• If i(moda) = q, rnj (3 bits are written in all cells Cj such 
that 1 ^ j (mod/?) ^ r. 

• If i(moda) > q, no information is written to the cells. 
The decoder is implemented in a very similar way. 

Example 2. Fig. [3] shows a typical writing sequence of an 
(a = 3, (3 = 3,p = 2) -constrained code of length 15 based on 
Construction Q] The z-th row corresponds to the cell-state vec- 
tor before the i-th write. The cells in the box in the i-th row 
are the only cells that can be programmed on the i-th write. It 
can be seen that the rate of the code is the ratio between the 
number of boxed cells and the total number of cells, which is 

2 
9' 



000 000 000 000 000 
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100 000 010 
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000 10 

000 1 



1 00 1 1 1 

100 110 10 



Fig. 3. A sequence of writes of a (3, 3, 2)-constrained code 

Theorem 3. The code C constructed in Construction^ is an 
(a, (3, p) -constrained code and its rate is R = 

Proof: We show that for all i ^ 1 and 1 ^ j ^ n — 
(3 + 1, the rewrite cost of the cells Cj, Cj+i, . . . , Cj+p-i over 
the writes i, i + 1, . . . , i + a — 1, is at most p. For all ^ 
k ^ a — 1 such that 1 ^ (i + fc)(moda) < x, all of the f3 
cells can be written and since there are x — 1 such values the 
rewrite cost on these writes is at most (x — 1) (3. For k, such 
that (i + fc)(moda) = x, at most y out of these (3 cells are 
programmed and therefore the rewrite cost is at most y. For all 
other values of k no other cells are programmed. Therefore, 
the total rewrite cost is at most 



(x-l)-P + y = 



— 1 I /3 + {p mod (3) = p. 



The total number of bits written on these a writes is pn//3 
and hence the rate of the code is 

pn//3 p 
an a/3 



R 



A. Space Constraint Improvement 

In this subsection, we improve the lower bound on C(l,/3,p) 
over that offered by the trivial construction. Let S„ (f3,p) be 
the set of all (/3,p)-WWL vectors of length n. We define a 
(/3,p)-WWL code Cwwl of length n as a subset of S n (f3,p). 
If the size of the code Cwwl is M, then it is specified by an 



encoding map £wwl :{!,-•■, M} i-» Cwwl and a decoding 
map Vwwl ■ Cwwl {1, . . . , M}, such that for all m 6 
{1,...,M}, V W WL(£wwL{m)) = to. 

The problem of finding (/3,p)-WWL codes that achieve 
the capacity Cwwl(P,p) is of independent interest and we 
address it next. Cover (9) provided an enumerative scheme 
to calculate the lexicographic order of any sequence in the 
constrained system. For the special case of p = 1, corre- 
sponding to RLL block codes, Datta and McLaughlin iflOl . 
ifTTl proposed enumerative methods for binary (d, fc)-RLL 
codes based on permutation codes. For (/3,p)-WWL codes, 
we find enumerative encoding and decoding strategies with 
linear complexity enumerating all (/3,p)-WWL vectors. We 
present the coding schemes and the complexity analysis in 
Appendix lAl In the sequel, we will simply assume that there 
exist such codes with rate arbitrarily close to the capacity 
as the block length goes to infinity for all positive integers 
p and (3. The next construction uses (/3,p)-WWL codes to 
construct (1, /3,p)-constrained codes. 

Construction 2 Let (3,p be positive integers such thatp ^ f3. 
Let Cwwl be a ((3,p)-WWL code of length n' and size M. 
Let £wwl and V W wl be its encoding and decoding maps. A 
(1, f3,p) -constrained code Ci,^, p of length n = 2n' + j3 — 1 
and its encoding map £ and decoding map V are constructed as 
follows. 

1) The encoding map £ : {1,...,M} x {0, 1}" {0, 1}" 
is defined for all (to, u) E {1, . . . , M} x {0, 1}" to be 
£((m, it)) = v, where 
a) v T / = u 7 l' + £ W WL(m). 



2) 



t_\ n -\-p — i f. 

b) tv|i =0, 

C ) <> + l3 = 

The decoding map T> : {0, l} n 
for all u G {0,1}™ to be 



{!,..., M} is defined 



V(u)=V W wl{v\ +< 



Example 3. Here is an example of an (a = l,/3 = 3,p = 2) 
code with n' = 4 for the first 4 writes. The message set has 
size M n > = 13 (See the definition of M n i in Definition |7). The 
length of the memory is 2n'+/3— 1 = 10. Suppose on the sec- 
ond write, the message is in = 7. Since lexicographically the 
seventh element in £4(3,2) is (0110), the encoder will copy 
the previous left block (1011) to the right block and flip the 
second and the third bits in the left block (1011) ->• (1101). 







































1st write, 


m 


= 11 : 


1 





1 


1 




















2nd write, 


m 


= 7 : 


1 


1 





1 








1 





1 


1 


3rd write, 


m 


= 13 : 




















1 


1 





1 


4th write, 


HI 


= 4 : 








1 


1 





















Theorem 4. The code Ci l( 3. p is a (1, f3,p) -constrained code. If 
the rate of the code Cwwl is Rwwl, then the rate of the code 
Ci,/3,p is 2n'+p-i ' Rwwl- Both the encoder and decoder of 
Ci,/3, P have complexity 0(n). 

Proof: Let u be the cell-state vector in Construction [2] 
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1) For it™ , encoder step a) guarantees that the positions of 
rewritten cells satisfy (/3,p)-WWL constraint. So there 
are at most p reprogrammed cells in any j3 consecutive 
cells in u™ . 

2) For u™,_i_p, three consecutive writes should be exam- 
ined. Let w,v,ube the cell-state vectors before the i-th, 
(i + l)-st, (i+2)-nd writes, i ^ 1. Encoder step a) means 
that = w\ + £wwL(rm), where ra, e {1,..., M} 
is the message to encode on the i-th write. Since encoder 
step c) guarantees that v™, +j3 = to" and u™, +/3 = t> x , 
we have u™, +/3 = v™, + p+£wWL(mi)- This proves that 
u™, + g satisfies the (1,(3, p) constraint. 

3) For it™/+ x _1 , the cell levels are always set to be 0, which 
ensures that no violation of the constraint happens be- 
tween u" and w™/ +/3 . 

On each write, one of M messages is encoded as a vector 
of length n. Hence, the rate is log ^ M — (^ log J 2 l , M 2n'+p-i ) = 

2n'lp-l - R WWL- 

The encoder £ and decoder T> come directly from £wwl 
and T>wwl, which have complexity 0(n) both in time and 
in space. Therefore, £ and T> both have linear complexity in 
time and in space. ■ 

Corollary 1. Let j3, p be two positive integers such thatp ^ /3, 
then 

nn o \ \ (C W wl{I3,p) p\ 
C(l,(3,p) > max | , -j . 

CorollaryQjprovides a lower bound that is achieved by prac- 
tical coding schemes. In fact, following similar proofs in |4|, 
JTJ, 0, we can prove the following theorem using probabilis- 
tic combinatorial tools J5)- 

Theorem 5. Let /3, p be positive integers such that (3 ^ p, then 

C(l,0,p) = Cwwz.W,p), 

where Cwwl((3,p) is the capacity of the (f3,p)-WWL con- 
straint. 

Proof: See Appendix iBl ■ 
B. Time Constraint Improvement 

Jiang et al. constructed in lfl6ll an (a, 1, 1) -constrained code. 
Let us explain their construction as it serves as the basis for 
our construction. Their construction uses Write-Once Memory 
(WOM)-codes [[221. A WOM is a storage device consisting of 
cells that can be used to store any of q values. In the binary 
case, each cell can be irreversibly changed from state to 
state 1. We denote by [n, t; 2 nRl , 2 nRt ] a t- write WOM 
code Cw such that the number of messages that can be writ- 
ten to the memory on its i-th write is 2 nRi , and the sum-rate 
of the WOM code is defined to be i? S um = J2t=i R i- The 
sum-capacity C SU m is defined as the supremum of achievable 
sum-rates. The code is specified by t pairs of encoding and 
decoding maps, (£i,T>i), where i £ {1,2, ... ,t}. Assuming 



that the cell-state vector before the i-th write is c,, the encoder 
is a map 

Si : [1 : 2 nRi ] x {0, 1}" -> {0, 1}", 
such that for all (m,c£_ 1)X ) £ [1 : 2 nR '] x {0, 1}™, 

where the relation is defined in Definition [7] The decoder 

T>i : {0,1}™ -> [1 : 2 nRi ], 

satisfies 

£>t(£i(m,c"_ Xil )) = m. 

for all m £ [1 : 2 nR -}, 

It has been shown in |fl5l that the sum-capacity of a t-write 
WOM is aum = log 2 (i+l). 

The constructed (a, 1, 1) -constrained code has a period of 
2(t + a) writes. On the first t writes of each period, the en- 
coder simply writes the information using the encoding maps 
of the t-write WOM code. Then, on the (f + l)-st write no 
information is written but all the cells are increased to level 
one. On the following a — 1 writes no information is written 
and the cells do not change their levels; that completes half 
of the period. On the next t writes the same WOM code is 
again used; however since now all the cells are in level one, 
the complement of the cell-state vector is written to the mem- 
ory on each write. On the next write no information is written 
and the cells are reduced to level zero. In the last a — 1 writes 
no information is written and the cells do not change their 
values. We present this construction now in detail. 

Construction 3 Let a be a positive integer and let Cw be an 

[n, t; 2 nRl 2 nRt ] t-write WOM code. Let £ t (m, u i _ x ) be 
the i-th encoder of C w , for m £ [1 : 2 nR >],i £ [1 : t]. An 
(a, 1, 1) -constrained code C QiX .i is constructed as follows. For 
alii ^ 1, leti' = i(mod(2(t + a))), where 1 s$ i' < 2(t + a). 
The cell-state vector after the i-th write is denoted by C;. On the 
i-th write, the encoder uses the following rules: 
. Ifi' £ [1 : t], write M v £ [1 : 2 nR '} such that 

ci = £i>(Mi>,Ci-i). 

• If %' = t + 1, no information is written and the cell-state 
vector is changed to the all-one vector 1, i.e., Ci = 1. 

• If i' £ [t + 2 : t + a] , no information is written and the 
cell-state vector is not changed. 

. Ifi' £ [t+a+1 : 2t+a], write M v ^ t -a G [1 = 2 nfl >'-*-°] 
such that 

Ci £%' — t— ccy^^i' — t— oti ^i— l)* 

• If i' = 2t + a + 1, no information is written and the cell- 
state vector is changed to the all-zero vector 0, i.e., C\ = 
0. 

• If i' £ [2t + a + 1 : 2(t + a)], no information is written 
and the cell-state vector is not changed. 

Remark 3. This construction is presented differently in lfl6l . 
This results from the constraint of having the same rate on 
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each write which we can bypass in this work. Consequently, 
in our case we can have varying rates and thus the code C a ,i, P 
can achieve a higher rate. 

Theorem 6. The code C a ,i,i is an (a, 1, 1) -constrained code. If 
the t-write WOM code Cw is sum-rate optimal, then the rate of 

C Q ,i,i is 



log 2 (t+l) 

t + Q 



Proof: In every period of 2(t+a) writes, every cell is pro- 
grammed at most twice; once in the first t + 1 writes and once 
in the first t + 1 writes of the second part of the write-period. 
After every sequence t + 1 writes, the cell is not programmed 
for a — 1 writes. Therefore the rewrite cost of every cell among 
a consecutive rewrites is at most 1, 

If the rate of the WOM code Cw is Rw then 2nRw bits 
are written in every period of 2(t + a) writes. Hence, the rate 
of C a n is 



2nR v 
2(t+a)n 



jq^j. If Cw is sum-rate optimal, the 



rate of C a i i is therefore log ?[ t+1) . I 
The next table shows the highest rates of (a, 1, 1) 
constrained codes based on Construction [3] for a = 4, ... ,8 



a 


4 


5 


6 


7 


8 


l/a 


0.25 


0.2 


0.167 


0.143 


0.125 


rate of C a ,i,i 


0.290 


0.256 


0.235 


0.216 


0.201 



Next, we would like to extend Construction [3] in order to 
construct (a, l,p) -constrained codes for all p ^ 2. For sim- 
plicity of the construction, we will assume that p is an even 
integer; and the required modification for odd values of p will 
be immediately clear. We choose t ^ 1 such that a ^ (p — l)t 
and the period of the code is a + t. On the first t writes of 
each period, the encoder uses the encoding map of the t-write 
WOM code. In the following t writes, it uses the bit-wise com- 
plement of a WOM code as in Construction [3] This procedure 
is repeated for | times; this completes the first tp writes in the 
period. On the (tp + l)-st write, no new information is writ- 
ten and the cell-state vector is changed to the all-zero vector. 
During the (tp + 2)-nd to (a + t)-th writes, no information 
is written and the cell-state vector is not changed. That com- 
pletes one period of a + t writes. 



• If i 1 G [pt + 2 : a + t], no information is written and the 
cell-state vector is not changed. 

Theorem 7. The code C a ,i,p is an (a, l,p) -constrained code. If 
the t-write WOM code Cw is sum-rate optimal, then the rate of 

(i ; c plog 2 (*+l) 

^a,l,p « a +t 

Proof: This is similar to the proof of Theorem |6l so we 
present here only a sketch of the proof. In every period of (a + 
t) writes, each cell is rewritten at most p times. In particular, 
the first rewrite happens before the (t + l)-st write. After that, 
the cell is rewritten at most p — 1 times until the (tp + l)-st 
write and then not programmed for a + t — (tp + 1) writes. 
Therefore, each cell is rewritten at most p times on a + t — 
(tp + 1) + (tp + 1) — t = a writes. This proves the validity of 
the code. 

If the rate of the WOM code Cw is Rw then pnRw bits are 
written during each period of a+t writes since the WOM code 
is used p times. Hence, the rate of C a ,i, v is 2{2+t)n ~ 



If that Cw is sum-rate optimal, the rate of C a ,i,p is plog2 ^ t+1 ^ 



2(a+t)r 



a+t 



a+t 



Remark 4. In Construction [4] we required that a ^ (p — l)t 



and, in particular, t ^ 



p-i 



, we can simply 



If t > 

use Construction [4] while taking a = (p — l)t, i.e., the period 
of writes is now pt and and we construct a ((p — l)t, l,p)- 
constrained code, which is also an (a, l,p) -constrained code. 
The rate of the code is Rw/t, where Rw is the rate of the 
WOM code C w - 

The next corollary provides lower bounds on C(a,l,p). 

Corollary 2. Leta,p be positive integer such thatp ^ a. Then, 

>l og 2 (t+l) log 2 (t* + l) p 
a + t ' t* ' a 

where 



C(a,l,p) ^ max 

t,t*ez + , 



1 s$ t sC 



p-1 



,t* 



p-1 



Construction 4 Let a,p, t be positive integers such that a ^ 
(p - l)t. Let C w be an [n, t; 2 nRl , . . . , 2 nRt ] t-write WOM 
code. Fori G [1 : t], let£i(m, i^-i) be its encoding map on the 
i-th write, wherein G [1 : 2 *]. An (a, l,p) -constrained code 
C a ,i,p is constructed as follows. For all i ^ 1, let i' = i(mod 
(a+t)), i" = i'( mod 2t) where (a+t), 1 ^ i" ^ 2t. 

The cell-state vector after the i-th write is denoted by Ci. On the 
i-th write, the encoder uses the following rules: 

. Hi' G [1 : pt] andi" G [1 : t], write M v , G [1 : 2 nR *"\ 
such that 

^ = ^-"(Mj'/jCi-i). 

. If i' G [1 : pt] andi" G [t + 1 : 2t], write M v ,_ t e [1 : 
2 nR i"-t] such that 

Ci = £ l n-t(M i n_ tl C i _l). 

• Ifi'=pt + l,no information is written and the cell-state 
vector is changed to 0, i.e., Ci = 0. 




5 10 15 20 25 



a 

Fig. 4. Lower bound on C(ce, l,p) 

Figure |4] shows the rates of (a, f3 = \,p) constrained codes 
obtained by selecting the best t for each pair of (a,p). In com- 
parison to the codes in Construction Q] whose rates are shown 
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by the dashed lines, our construction approximately doubles 
the rates. Our lower bounds achieve approximately 78% of the 
corresponding upper bounds on C(a,l,p). 



where Xi{n) is the number of (/3,p)-WWL vectors of length 
n that have the vector Sj as a prefix, where x T denotes the 
transpose of x. 



C. Time-Space Constraint Improvement 

In this section, we are interested in combining the improve- 
ments in time and in space to provide lower bounds on the 
capacity of (a, /3,p)-constraints. 



Theorem 8. For alla,j3,p positive integers, 

C(a,l,p) C(l,f3,p) 



Lemma 2. The vectors xp tPi „ + i, n ^ /3, satisfy the first-order 
recursion: 



C(a, /3,p) ^ max 



Proof: An (a, /3,p)-constrained code can be constructed 
in two ways. 

1) Let C be a (1, (3,p) -constrained code of rate R and 
length n. We construct a new code C with the same 
number of cells. New information is written to the 
memory on all i-th writes, where i = 1( mod a), simply 
by using the |"^]-th write of the code C. Then, the code 
C is an (a, -constrained code and its rate is R/a. 
Therefore, we conclude that C(a,(3,p) ^ C< * 1 f ■ 

2) Let C be an (a, l,p) -constrained code of rate R and 
length n. We construct a new code C for n(3 cells: 
(ci,C2, ■ • ■ ,Cnp). The code C uses the same encoding 
and decoding maps of the code C, while using only 
the n cells c, such that i = 1 (mod/3). Then, the code 
C' is an (a, /3,p)-constrained code and its rate is R//3. 
Therefore, we conclude that C(a,(3,p) ^ C ("^ 1 'P) _ 

The capacity must be greater than or equal to the maximum 
of the two lower bounds. ■ 

Appendix A 

In this section, we show an enumerative encoding and de- 
coding strategy with linear complexity for the set of (j3,p)- 
WWL vectors. 

Definition 7. Let X = {xi, . . . ,xn} be a set of distinct binary 
vectors, Xi G {0, 1}™, i = 1, . . . , N. Let i/j(x) denote the deci- 
mal representation of a vector x £ {0, 1}". Forx, y G {0, 1}", 
we say x < y (or x -< y) if and only if ip(x) ^ "0(2/) ( or 
ip(x) < ip(y)). The order of the element Xi in X is defined as: 

ord(xi) = \{j : Xj ^ x h 1 < j < N}\. 

Let {ci,...,Cm„} be an ordering of the elements in 
S n (/3,p), where M n = \S n (/3,p)\. The encoder and decoder 
of a (/3,p)-WWL code give a one-to-one mapping be- 
tween S n (j3,p) and {1, ...,M n }, namely S WWL (m) = c m 
where o(c m ) = m and V WWL (c m ) = o(c m ) = m, 
for all m = {1, ...,M n }, Now the problem is to calcu- 
late o(c m ) given c m . Let s±,...,smb-i be the ordering 
of the vectors in Sp p introduced in Definition [5] where 

= ispj = \s P U{fi,p)\ = ELo C 3 ; 1 )- Let 

xp,p,n = (xi(n),x 2 (n), . . .,% M (n)) T , 



Proo/: See |25|. ■ 
The encoder and decoder have access to a matrix Xp tP>n € 



1 , where the i-th row of Xp, p/H is x^ vi , 



1, ... ,n + p. For simplicity, Xp tP ^ n is written as X if no 
confusion can occur. We denote by X(i,j) the entry in the 
i-th row and j-th column of X and we define X(i, :), X(:,j) 
to be the i-th row vector, j-th column vector of X, re- 
spectively, i.e., X(i, :) = (X(i, 1), . . . , X(i, Mg_i)) and 
X(:,j) = (X(l,j),...,X(n + p,j)) T . FromLemmaEl 
Xp pn can be calculated efficiently with time complexity 
0{n). 

1) Decoder: Based on X^ p n , we present an enumerative 
method to calculate the order of each element in S n ((3,p). 
Note that the order of a vector is the decoded message corre- 
sponding to that vector. In this algorithm, the decoder scans 
the vector from left to right. Whenever the decoder finds a 1 
in the vector, the order of the vector will increase. The details 
of the algorithm are presented below. Here c = (ci, . . . , c n ) G 
S n (f3,p) is the binary vector to be decoded; the algorithm cal- 
culates o(c) G {1, . . . , Mn}. 

Algorithm 1 Decoding: Calculate o(c), c g S n ((3,p) 
1: letcnt = 0,j = l,i = 0; 
2: whiJe (i ^ n){ 
3: whiJe (j ^ n and c(j) ^ 1) 
4: j = j + 1; 

5: if(7=n + l) 
6: o(c) = cnf + 1; 

7: algorithm ends; 

8: } 

9: /*A 1 is detected in c. */ 

10: let d = (0, . . . , 0) with length /3-1; 

1 1 : /*d is a vector storing j3 — 2 bits to the left of the detected 
1, appended with a 0.*/ 

12: 

13: 

14: 

15: 

16: find k G [1 : Mp-i] such that s k = d; 
17: cat = cut + X(n — j + /3 — 1, k); 
18: i = j;j = i + l; 
19: } 

20: o(c) = cut + 1; 
2 1 : algorithm ends. 



if (j^ 13-1) 

letd ' 2 - c^ 1 ■ 
else /*j < (3 - 1 */ 

j/3-2 



J-l. 
-1 
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Example 4. Suppose we would like to decode a (6,3)-WWL 
vector c = (1011001001) of length 10. 

. A 1 is detected (1011001001), where i = 0,j = 1. 
The decoder aims to find the number of vectors c 
such that (0000000000) d c -< (1000000000). Now 
d = (00000) = s u so k = 1, and n - j + (3 - 1 = 14. 
Therefore, cnt = + X 6 ,3,i 6 (14, 1) = 236. 

. A 1 is detected (1011001001), where i = l,j = 3. 
The decoder aims to find the number of vectors c 
such that (1000000000) ^ c ^ (1010000000). Here 
d = (00100) = s 5 , so k = 5, and n - j + (3 - 1 = 12. 
Therefore, cnt = 236 + X 6j3 , 16 (12, 5) = 308. 

. A 1 is detected (1011001001), where i = 3,j = 4. 
The decoder aims to find the number of vectors c 
such that (1001000000) d c -< (1011000000). Here 
d = (01010) = s n , so k = 11, andn-j + /?-l = 11. 
Therefore, cnt = 308 + X 6 , 3 ,i 6 (ll, 11) = 343. 

. A 1 is detected (1011001001), where i = 4,j = 7. 
The decoder aims to find the number of vectors c 
such that (1011000000) ^ c -< (1011001000). Here 
d = (11000) = s 23 , so k = 23, and n - j + (3 - 1 = 8. 
Therefore, cnt = 343 + X 6 , 3 ,i 6 (8, 23) = 351. 

. Finally, a 1 is detected (1011001001), where i = 7,j = 
10. The decoder aims to find the number of vectors c 
such that (1011001000) d c -< (1011001001). Here d = 
(01000) = s 9 , so k = 9, and n-j+fi-l = 5. Therefore, 
cnt = 351 + X 6 ,3,i6(5, 9) = 352. 
We calculate that o(c) = cnt + 1 = 353 and c is decoded as 
353. 

Theorem 9. Algorithm\T\calculates the order of a (f3,p)-WWL 
vector of length n in S n ((3,p). Its time complexity and space 
complexity are both 0(n). 

Proof: 

Correctness: Let c be the vector to decode; that is, we seek 
to find ord(c). For Ci -< c 2 , we denote by N(c\, c 2 ) the num- 
ber of vectors c such that Ci < c -< c 2 . Let C\, . . . , cj, be a 
sequence of vectors such that = Cq ^ c.\ d: c 2 • • • d: 
cl = c; then it is easy to see 

L 

o(c) = 5^JV(c i _i,c i ) + l. 

i=l 

Let L be the number of l's in c; let all the indices of l's be 
ji , j2 , . . . , j'l in ascending order, that is 1 ^ j\ < ■ • • < Jl ^ 
n and Cj 1 = cj 2 = ■ ■ ■ = cj L = 1. For i G {1, . . . , L}, <\ is 
chosen such that c$ = c,-i + 5^, where Co = 0, and 8j, j G 
{1, . . . , n}, denotes the vector where all entries are except 
for the j-th entry, which is a 1. Here addition is component- 
wise modulo-2 summation. 

Lines 3 and 4 together with Line 18 in Algorithm [TJ scan c 
and find Cj according to C;_i. Therefore, we are left to prove 
that Algorithm Q] calculates N(ci-i, Cj) for i G {1, . . . , L}. 

By definition, the first ji — 1 digits of Cj and Cj_i are the 
same, and Cj ^ = 1 while Cj_i ^ = 0. Then a vector c G 
{0, 1}" satisfies Cj_i ^ c -< if and only if the first ji digits 
of c are the same as those of Cj_i, i.e. Cj* = c^ij x . Given 



the length and the first ji digits of c, the number of possible c 
can be calculated based on the matrix X in the following way. 
Since the (/?,p)-WWL constraint is local, if ji > f3 — 1, the 
task is equivalent to calculating the number of c with length 
n — + — 1 such that the first /3 — 1 digits are a prefix of 
c, in particular, c^ 1 = cj ! _ (3+2 ; otherwise, for ji ^ j3 — 1, it 
is equivalent to calculating the number of c with length n — 
ji + /3 — 1 such that the first j3 — 1 digits are zeros followed by 
length-ji prefix of c, that is, Cj _1 = (0^_i_j i; c^). Lines 10 
- 15 in Algorithm Q] find the first (3 — 1 digits of c and Lines 
16 and 17 calculate the number of c, which is the number of 
vectors c satisfying c»_i d c -< c.;. Therefore, Algorithm [TJ 
calculates iV(ci-i, Cj) for i G {1, . . . , £} and sums them up 
to derive the order of c. 

Time complexity analysis: It can be seen from the algo- 
rithm that the decoder scans the vector that is to be decoded 
only once. Whenever the decoder detects a 1, it uses binary 
searches to find the corresponding prefix vector d in X, while 
the number of l's is no more than Therefore, the time 
complexity of the decoder is no more that 0{^§- \ogMp_{) = 

0(^ log£Lo C 3 ^ 1 )) = where P and P are fixed in- 

tegers and not related to n. 

Space complexity analysis: The space complexity comes 
from the matrix X with n + [3 — 1 rows and Mp-i columns. 
Therefore, the space complexity is also 0(n) since (3 and 
Mp-i are both fixed integers. ■ 

2) Encoder: The encoder follows a similar approach to 
map an integer m G {1, . . . , M n } to a vector c G S„((3,p), 
such that o(c) = m. We call c the encoded vector for the 
message m. Note that Vto;, nij G {1, . . . , M n }, mi ^ rrij if 
and only if Ci d Cj, where o(ci) = rrii and o(cj) = rrij. 
The following encoding algorithm uses the matrix X to effi- 
ciently calculate the vector c G S n (/3,p) such that o(c) = m, 
for m G {1, . . . , M n }. The algorithm has linear complexity. 

Algorithm 2 Encoding: find c such that o(c) = m 
Jet cnt = 0, c = (0, . . . , 0) with length n; 

fori = 1, 2, . . . , n { 

Jet i = c; 

lett(i) = 1; 

if t satisfies (/3,p)-WWL constraint { 

let q = (0, . . . , 0) with length /3 - 1; 

/*q is a vector storing (3 — 2 bits to the left oft{i) in t, appended 
with a 0.*/ 

if {i^ 0-1) 

ieiq 1 — r i _ /3+2 , 

else /*i < (3 - 1 */ 

find k G [1 : Mp-i] such that Sfc = q. 

Jet CntTry = cnt + X(n - i + f3 - 1, k); 
if{CntTry + l = m) { 
c = t; 

return c; algorithm ends; 

} 
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if(CntTry + 1< to) { 
Jetc(i) = 1; 
let cnt = CntTry; 

} 

} 

} 

Example 5. Suppose we would like to encode one of M n = 
421 (fi = Q,p = 3)-WWL vectors of length n = 10. The 
message to be encoded is to = 353. 

. c = (0000000000), i = 1, t = (1000000000), 

q = (00000) = si, so k = 1. Since cnt = 0, 

CntTry = cnt + X(n - i + p - 1, fc) = 236 < to - 1, 

so set cnt = 236. 
. c = (1000000000), i = 2, t = (1100000000), 

q = (00010) = s 3 , so k = 3. Compute CntTry = cnt + 

X(n - i + - 1, k) = 236 + X(13, 3) = 355 > to - 1. 
. c = (1000000000), i = 3, t = (1010000000), 
= (00100) = s 5 , so k = 5. Compute CntTry = cnt + 

X(n - i + /3 - 1, fc) = 236 + X(12, 5) = 308 < m - 1, 

so set cnt = 308. 
. c = (1010000000), i = 4, t = (1011000000), 

q = (01010) = s u , so fc = 11. Compute CntTry = 

cnt + X(n - i + P - 1, fc) = 308 + X(ll, 11) = 343 < 

?7i — 1, so set cnt = 343. 
. c = (1011000000), i = 5, t = (1011100000) does not 

satisfy (6,3)-WWL constraint. 
. c = (1011000000), i = 6, t = (1011010000) does not 

satisfy (6,3)-WWL constraint. 
. c = (1011000000), i = 7, t = (1011001000), 

q = (11000) = s 23 , so fc = 23. Compute CntTry = 

cnt + X(n - i + P - 1, fc) = 343 + X(8, 23) = 351 < 

??i — 1, so set cnt = 351. 
. c = (1011001000), i = 8, i = (1011001100) does not 

satisfy (6, 3)-WWL constraint. 
. c = (1011001000), i = 9, t = (1011001010), 

q = (00100) = s 5 , so fc = 5. Compute CntTry = 

cnt+X(n-i+P-l, k) = 351+X(6, 5) = 353 > m-1. 
. c = (1011001000), i = 10, t = (1011001001), q = 

(01000) = s 9 , so k = 9. Compute CntTry = cnt + 

X(n - i + P - 1, fc) = 351 + X(5, 9) = 352 = m - 1. 

Therefore, c = t= (1011001001) and o(c) = 353. 

Theorem 10. Algorithm^encodes a message m £ { 1, . . . , M ra } 

to a (/3,p)-WWL vector c e S n (p,p) such that o(c) = to, a/id 
its time complexity and space complexity are both 0(n). 

Proof: 

Correctness: The proof of the correctness of the encoder is 
similar to the proof of the correctness of the decoder. There- 
fore, we omit the details. 

Time complexity analysis: It can be seen from the algo- 
rithm that the encoder scans the vector from left to right once 
and tries to set each entry to 1. Whenever the encoder sets 
an entry to 1, it first determines whether the constraint is sat- 
isfied. This takes 0(1) steps since we do not have to check 
the entire vector but only the p bits to the left of the set 



entry. Then it uses binary search to find the corresponding 
prefix vector in X, while the number of l's is no more than 
Therefore, the complexity of the encoder is no more that 

0(f logM^x) = 0(f logELo (Y)) = 0(n), where p 
and P are fixed numbers. 

Space complexity analysis: The matrix X is the primary 
contributor to the space complexity. As is shown in the proof 
of Theorem [9] the space complexity is also 0(77). ■ 

Note that Algorithm [2] and Algorithm Q] establish a one-to- 
one mapping between {l,...,M n } and S n (p,p). Therefore 
the rate of the encoder is maximized. If the blocklength goes 
to infinity, the rate of the encoder approaches Cwwl(P,p)- 

Appendix B 

In this section, we present the proof of Theorem [5] The 
reason for which the proof of Theorem is non-trivial is the 
following. Suppose the cell-level vector is updated from c,_i 
to Ci on the i-th write. The encoder has full knowledge of 
Ci-x and Cj since we assume there is no noise in the updat- 
ing procedure. The decoder is required to recover Cj + c,_i 
with full knowledge of Ci but zero knowledge of Cj_x. This 
is similar to the work on memories with defects in 0, where 
the most interesting scenario is when the defect locations are 
available to the encoder but not to the decoder. In general, it 
can be modeled as a channel with states JT] where the side 
information on states is only available to the encoder. 

Proof: First we introduce some definitions. Recall that 
S n (P,p) is defined as the set all (P,p)-WWL vectors of length 
77. S n {P,p) will be written as 5 for short if no confusion 
about the parameters can occur. Let V n = {0, 1}" be the 77- 
dimensional binary vector space. 

Definition 8. For a vector x e V n and asetS C V n , we define 
S + x = {s + x\s G S} and denote it by S(x). We call vectors 
in S(x) reachable by x and we say S(x) is centered at x. 

For two subsets Bi, B2 C V n , we define B\ + B2 = {bi + 
b 2 \bi e Bi,b 2 e B 2 }. We call a subset B c V n S-goodif 

S + B= (J S(b) = V n , 

beB 

i.e., V n is covered by the the union of translates of S centered 
at vectors in B. 

Lemma 3. If B c V„ is S-good, then t + B is S-good, Vt € 
V n . 

Lemma 4. If B c V n is S-good, then Vx e V n , 3b e B, 3s e 
S, such that x + s = b. 

Lemma [4] guarantees that if B C V n is an £-good subset, 
then from any cell-state vector x, there exists a (P,p)-WWL 
vector s, such that x + s £ B. We skip the proofs of Lemma [3] 
and 2] referring the reader to similar results and their proofs 
in Q. 

Lemma 5. IfGi, . . . , Cm are pairwise disjoint S-good subsets 
ofV n , then there exists a (1, P,p) -constrained code of size M. 
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In particular, if G is an S-good (n, k) linear code, then there 
exists a (1, j3, p) -constrained code with rate 

Proof: If Gi is <S-good for all i £ [1 : M], then from 
LemmaHl for any x £ V n and i £ [1 : M], there exist g i £ Gi 
and Si £ 5, such that x + Si = g { . Suppose the current cell- 
state vector is x, then we can encode the message i £ [1 : M] 
as a vector £{i,x) = x + Si £ Gi, for some Sj £ 5. The 
decoder uses the mapping 2?(cc) = i, if x € Gi, to give an 
estimate of i £ [1 : M\. This yields a (1, /?,p)-constrained 
code of size M. 

If Gi, . . . , G 2 n-i represent the cosets of an <S-good (n, k) 
linear code G, then each coset is 5-good according to 
Lemma [3] The rate of the resulting (1, f3,p) -constrained code 
• io g2 (2"- fc ) _ n-k _ 

n n 

Now we are ready to prove Theorem [5] 

Let Bj be a randomly chosen (n,j) linear code with 2 J 
codewords (Bq = {0}), and let m B . = \V n /{Bj + S)\ be 
the number of vectors not reachable from any vector in Bj. 
Let x £ V n be a randomly chosen vector and let Q B . be the 
probability that x £ Bj + S. Then we have 

m B] =2 n Q Bj . 

The proof of the following lemma is based upon ideas dis- 
cussed in flU pp. 201-202]. 

Lemma 6. There exists a linear code Bj such that 

Proof: Let Bj = {y 1 , . . . ,j/ 2 j} denote an (n,j) linear 
code. If 



then 



Qi 



B,+S, 



1 - 2~ n N B < 



where N Bj = \S Bj \. 

Let z ^ and let Bj+i <z be the + 1) linear code 
formed by [z + Bj) U Bj. It can be seen Bj+i. z comprises 
the 2 J vectors in Bj plus 2 J new vectors of the form z + y, 
y £ Bj. Let 

It can be seen that S B . z has the same cardinality as S B .. 
Therefore, it contains N B . vectors, too, some of which may 
already belong to S Bj . Since S Bj+l x = S Bj LiS B z , we have 

N Bi+liM =2N Bj -\S Bi nS% jilt \. 
Thus N Bj+1 K is maximized by choosing z that minimizes 

\s B] n s* BjZ \. 

Let us now calculate the average of \S Bj D S g . z \ over all 
z £ V„. Here all z £ are also considered since they will 



result in an overestimate of the average of \S B , <~]S B . z \. Then 

E h 



E l^n^ 



E j 



E^i 



@ 



© 



E E i 

x&Sbj z&x+SBj 

E ^ 

= N Bj , 

where 1a is the indicator function of the event A, i.e., 1a = 1 
if A is true and 1a = otherwise. 

Equality © holds since, for a fixed x, if 2 £ x + S B . , then 
x <E S B . z and vice versa. Equality © holds since + | = 
ISbJ = ^Vbj- Thus, the average value of \S Bj n 5^. 2 | is 
2~ n N B .. Since the minimum of H5|j. z \ cannot exceed 
this average, we conclude that there exists z £ V n , such that 
\S Bj r\S B . J < N B .. Then there exists B j+1 , such that 



N Bj+1 > 2N Bj - 2 n N 2 Bj 



Thus, 



Qb ]+1 = 1 - 2- n N Bj+1 

< l-2- n (2AT B . -2" n AT|.) 
= (l-2~ n N Bj ) 2 

= Q%- 

It follows that there exists Bj, such that Q Bj ^ Qfj . ■ 

Lemma 7. 7f j n — log |<S| + log n, then there exists Bj such 
thatm Bj < 1. 

Proof: Note that Q Bo = 1 - 2~ n • N Bo ^ 1 - 2"" • 
Then there exists i?j, such that 

Qb 3 < Q 2 b 

«C (l-2-"|5|) 2J 
^(l-2-"|S|) 2 "~ log|SI+I ° E " 
= (l-2-"|5|) 2 " l<sr1 -" 

< e _n < 2~ n . 

Then m B] = 2 n Q Bj < 1. ■ 

Since is an integer and mg. < 1, there exists an (n,j) 
linear code such that m Bj = 0, i.e., an 5-good Bj exists. 
According to Lemma [5] there exists a sequence of (l,/3,p)- 
constrained codes of length n and rate R n (l, (3,p) such that 

n — (n — log |«S| + log n) 



supi?„(l,/3,p) ^ lim 



= lim 



lim 



log 1^1 -logn 
n 

log |5| 



n— yoo n 

Cwwl (/3,p)- 
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We have seen in Theorem Q] that C(l,f3,p) < Cwwl{P,p). 
This concludes the proof that C(l,f3,p) = Cwwl(P,p) ■ 
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